The 2nd International Workshop on Inductive Reasoning and Machine Learning for the Semantic Web Proceedings
نویسندگان
چکیده
I will describe a novel meta-learning approach to optimizingthe knowledge discovery or data mining (DM) process. This approach hasthree features that distinguish it from its predecessors. First, previousmeta-learning research has focused exclusively on improving the learningphase of the DM process. More specifically, the goal of meta-learning hastypically been to select the most appropriate algorithm and/or parame-ter settings for a given learning task. We adopt a more process-orientedapproach whereby meta-learning is applied to design choices at differentstages of the complete data mining process or workflow (hence the termmeta-mining). Second, meta-learning for algorithm or model selectionhas consisted mainly in mapping dataset properties to the observed per-formance of algorithms viewed as black boxes. While several generationsof researchers have worked intensively on characterizing datasets, littlehas been done to understand the internal mechanisms of the algorithmsused. At best, a few have considered perceptible features of algorithmslike their ease of implementation or their robustness to noise, or the in-terpretability of the models they produce. In contrast, our meta-learningapproach complements dataset descriptions with an in-depth analysisand characterization of algorithms their underlying assumptions, opti-mization goals and strategies, together with the structure and complexityof the models and patterns they generate. Third, previous meta-learningapproaches have been strictly (meta) data-driven. To make sense of theintricate relationships between tasks, data and algorithms at differentstages of the data mining process, our meta-miner relies on extensivebackground knowledge concerning knowledge discovery itself. For thisreason we have developed a data mining ontology, which defines the es-sential concepts and relations needed to represent and analyse data min-ing objects and processes. In addition, a DM knowledge base gathers as-sertions concerning data preprocessing and machine learning algorithmsas well as their implementations in several open-source software pack-ages. The DM ontology and knowledge base are domain-independent;they can be exploited in any application area to build databases de-scribing domain-specific data analysis tasks, datasets and experiments.Aside from their direct utility in their respective target domains, suchdatabases are the indispensable source of training and evaluation datafor the meta-miner. These three features together lay the groundwork forsemantic meta-mining, the process of mining DM meta-data on the basisof data mining expertise distilled in an ontology and knowledge base.
منابع مشابه
Inductive Reasoning and Machine Learning for the Semantic Web
Claudia d’Amato aNicola Fanizzi a Marko Grobelnik b, Agnieszka Lawrinowicz c and Vojtech Svatek d a Department of Computer Sciece University of Bari, Italy E-mail: {claudia.damato | nicola.fanizzi}@uniba.it b Jozef Stefan Institute, Ljubljana, Slovenia E-mail: [email protected] c Poznan University of Technology, Poland E-mail: [email protected] d University of Economic...
متن کاملInductive learning for the Semantic Web: What does it buy?
Nowadays, building ontologies is a time consuming task since they are mainly manually built. This makes hard the full realization of the Semantic Web view. In order to overcome this issue, machine learning techniques, and specifically inductive learning methods, could be fruitfully exploited for learning models from existing Web data. In this paper we survey methods for (semi-)automatically bui...
متن کاملMining the Semantic Web Statistical Learning for Next Generation Knowledge Bases
In the Semantic Web vision of the World Wide Web, content will not only be accessible to humans but will also be available in machine interpretable form as ontological knowledge bases. Ontological knowledge bases enable formal querying and reasoning and, consequently, a main research focus has been the investigation of how deductive reasoning can be utilized in ontological representations to en...
متن کاملBuilding Rules on top of Ontologies? Inductive Logic Programming can help!
Acquiring and maintaining Semantic Web rules is very demanding and can be automated though partially by applying Machine Learning algorithms. In this paper we show that the form of Machine Learning known under the name of Inductive Logic Programming (ILP) can help. In particular, we take a critical look at two ILP proposals based on knowledge representation frameworks that integrate Description...
متن کاملDL-Learner - A framework for inductive learning on the Semantic Web
In this system paper, we describe the DL-Learner framework, which supports supervised machine learning using OWL and RDF for background knowledge representation. It can be beneficial in various data and schema analysis tasks with applications in different standard machine learning scenarios, e.g. in the life sciences, as well as Semantic Web specific applications such as ontology learning and e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010